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Abstract — In this paper, an improved thresholding approach 
based on neutrosophic sets (NSs) and adaptive thresholding 
is proposed. This is applied to degraded historical documents 
imaging and its performance evaluated. The input RGB image 
is transformed into the NS domain, which is described using 
three subsets, namely the percentage of truth in a subset, the 
percentage of indeterminacy in a subset, and the percentage of 
falsity in a subset. The entropy in NS is employed to evaluate 
the indeterminacy with a A-mean operation used to minimize 
indeterminacy. Finally, the historical document image is bina- 
rized using an adaptive thresholding technique. Experimental 
results demonstrate that the proposed approach is able to select 
appropriate image thresholds automatically and effectively, while 
it is shown to be less sensitive to noise and to perform better 
compared with other binarization algorithms. 

Keywords: image binarization, thresholding, historical manuscript 
image, neutrosophic theory. 

I. Introduction 

Ancient Arabic documents typically suffer from various 
degradations due to both ageing and uncontrolled environmen- 
tal conditions [1]. The main artefacts encountered in digitally 
captured images of historical documents are [1], [2]: shadows, 
non-uniform illumination, smear, strain, bleed-through and 
faint characters. Fig. 1 illustrates some examples of degraded 
historical Arabic manuscript images. These degradations arise 
either due to the physical storage conditions of the original 
manuscript, or because of writers having used different quan- 
tities of ink and pressure resulting in characters with different 
intensities and/or thicknesses as well as faint characters [1]. 
In addition, some documents contain extra details such as 
diacritics, decorations, or have writing in multiple colors. 

Binarization of such document images is typically an es- 
sential pre-processing task, and hence important for document 
analysis systems. It converts an image into bi-level form in 
such way that the foreground (text) information is represented 
by black pixels and the background by white ones [3]. 
Although document image binarization has been studied for 
many years, thresholding of historical document images is still 
a challenging problem due to the complexity of the images and 
the above mentioned degradations [2]. 

Neutrosophic set (NS) approaches are relatively new and 
have been applied to various image processing tasks such as 
thresholding, segmentation and denoising [4]. In this paper, 
a new hybrid algorithm for binarization of degraded Arabic 
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Fig. 1. Examples of manuscript images containing multi-colored text lines 
with different degradations. 



manuscript image is proposed, which modifies the previous 
algorithm of [5] to work in an adaptive manner. Experimental 
results demonstrate that the proposed approach is able to select 
appropriate image thresholds automatically and effectively, 
while it is shown to be less sensitive to noise and to performs 
better compared with other binarization algorithms. 

The remainder of the paper is structured as follows: Sec- 
tions II and III present related work on image binarization 
and neutrosophic sets. In Section IV, the proposed method for 
binarization of historical document images is presented. Sec- 
tion V gives experimental results, while Section VI concludes 
the paper. 

II. Document Image Binarization 

Approaches for document image binarization can be 
grouped into two main categories: global and local approaches. 
Global algorithms select a single threshold value for the entire 
image. This gives good results if there is a good separation 
between foreground and background. However, for historical 
documents, this approach is not robust enough [3]. To deal 
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with degradations, the current trend is to use local information 
that guides the threshold value, often pixel- wise, in an adaptive 
manner [6], [7]. Hybrid methods [3] combine global and local 
information to assign pixels to one of the two classes (text or 
background). 

Otsu’s thresholding algorithm [8] is the most widely em- 
ployed global method. It tries to find the threshold t which 
separates the gray-level histogram, in an optimal way, into two 
segments. It maximizes the inter-class variance and minimizes 
the intra-class variance with the calculation of inter-class and 
intra-class variances based on the normalized histogram of 
the image H = [ho...h 255 ] where Y^h(i) = 1. The inter-class 
variance is given by 

^ inter = 9l(*) x 92 (t) X [fn (t) ~ , (1) 



where 


1 t i 






^ = qi (t) E i=0 h ^ xi ’ 
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and 
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Niblack’s algorithm [9] calculates a pixelwise threshold in a 
sliding window fashion. The threshold t is computed by using 
the mean fi and standard deviation a of all the pixels in the 
window, and is derived as 

t = -\- k x a, (6) 

where k is a constant in [0; 1] that determines how much of 
the total print object edge is retained. 

Sauvola’s algorithm [10] is a modification of Niblack’s 
approach claimed to give improved performance on documents 
where the background contains light texture, big variations and 
uneven illumination. A threshold is computed based on the 
dynamic range R of the standard deviation as 

t = fix (l + k(a/R-l)), (7) 

where A; is a fixed value. According to [11], this method is 
shown to be more effective than Niblack’s algorithm when 
the gray-level of the text is close to 0 and that of background 
close to 255. However, in images where the gray-levels of 
background and text pixels are close, the results are unsatis- 
factory. 

III. Neutrosophic Theory 

Neutrosophy theory considers an event, concept, or entity A 
in relation to its opposite Anti- A and neutrality Neut-A, which 
is neither A nor Anti- A. Neut-A and Anti- A are referred to as 
Non-A. Every idea A tends to be neutralized and balanced by 
Anti -A and Non-A. For example, if A = “white”, then Anti- 
A = “black”, Non-A = “blue, yellow, red, black, etc.” (any 



color except white), and Neut-A = “blue, yellow, red, etc.” 
(any color except white and black) [12], [13]. 

A. Neutrosophic Set 

Let U be a universe of discourse, and a neutrosophic set 
A included in U. An element x in the set M is denoted 
as x(T, /, F). T, I and F are real standard or non-standard 
subsets of ] — 0, 1 + [ with supT = t_ sup, infT = 
t_ inf, sup I = i_sup, inf/ = z_inf, supF = /_ sup, 
infF = /_ inf, n_sup = t_sup +z_sup +/_sup, and 
n_inf = £_inf +i_inf +/_inf. T, / and F are referred to 
as neutrosophic components. 

x(T, /, F) belongs to A in the following way: it is t% true 
in the set, i% indeterminate, and /% false, where t varies in T, 
i varies in /, and / varies in F.T, I and F are subsets, while 

T, / and F are functions/operators depending on known or 
unknown parameters. T, / and F are not necessarily intervals, 
and may be discrete or continuous, single-element, finite, or 
countable infinite, union or intersection of various subsets, etc. 
They may also overlap. 

B. Neutrosophic Image 

Let U be a universe of discourse, and W be a set included in 

U, which is composed of bright pixels. A neutrosophic image 
Pns is characterized by three subset T, / and F. A pixel P 
in the image is described as P(T, /, F) and belongs to W in 
the following way: it is t% true in the bright pixel set, i% 
indeterminate, and /% false. The pixel P(i,j) in the image 
domain is transformed into neutrosophic domain by 

P NS (i,j) = {T(i,j),I(i,j),F(i,j)}, (8) 

where /(z,j) and F(z, j) are the probabilities of 

belonging to the bright, indeterminate and non-bright set, 
respectively, which are defined as 



T(i,j) = 9min , 

Pm ax Pmin 


(9) 


F(i,j) = 1 —T(i,j), 


(10) 


XI O max Xt Omin 


(11) 


= \e(i,j)\, 


(12) 



where Ho(i,j) is the homogeneity value of T at (z,j), which 
is described by the local gradient value g(i,j) is the 

intensity value of the pixe 1 g m in and g max are the 

minimum and maximum value of g(i,j ) respectively. 

C. Neutrosophic Image Entropy 

Th entropy is utilized to evaluate the distribution of different 
gray levels in an image. If the entropy is maximal, the 
different intensities have equal probability and the intensities 
thus distribute uniformly. On the other hand, if the entropy 
is small, the intensities have different probabilities and their 
distributions are non-uniform. 
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Neutrosophic image entropy is defined as the summation of converted to gray level images Pq using the NTSC standard 
the entropies of the three subset T, / and F, and is employed method, 
to evaluate the distribution of the elements in the NS domain: 



with 
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where Etit , Pnj and Euf are the entropies of subsets T, 
/ and P, respectively, and Pt(i), Pj(«) and P^(z) are the 
probabilities of element i in T, I and P, respectively. Etit and 
Erip are utilized to measure the distribution of the elements 
in the neutrosophic set, and Em is employed to evaluate the 
indetermination distribution. 
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Fig. 2. Overview of the proposed document analysis approach. 



D. A -mean Operation 

As mentioned, the value of I(i,j) is employed to measure 
the indeterminate degree of PNs(i,j)- To make / to be 
correlated with T and P, changes in T and P influence the 
distribution of elements in I and the entropy of I. In the gray 
level domain, a A-mean operation for image X can be defined 
as 

i+w/2 j+w/2 

X(i,j) = — — — Y Y X(m,n), (17) 

m—i — w 1 2 n=j—w/2 

where w is the local window size. 

A A-mean operation for Pns is defined as: 

Pns( A) = P(T(X) ,I(\),F(X)), (18) 



with 
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( 20 ) 



and 



Ix+j) = 1 - 



H++j) - HOmin 

H-Om ax POm in 



( 21 ) 



where Po(i,j) is the homogeneity value of T(X) at (i, j). 

After the true subset T is handled using the A-mean 
operation, noise in T is removed and T becomes more 
homogeneous, and consequently more suitable to segment T 
precisely even using a simple thresholding method. 



A. Pre-processing 

Since historical document images are usually of low quality, 
a pre-processing stage is essential in order to eliminate noise 
areas, smooth the background texture and better highlight the 
contrast between background and text areas [14]. The use of a 
low-pass Wiener filter has proved efficient in this context [15], 
with the window size of the filter selected according to the 
minimum character line width [14]. In our method, the window 
size is selected as 3 x 3. The filtered gray level image P gw 
can the be binarized in the next stage. 

B. Binarization 

This is the stage where we employ a modified version 
of the neutrosophic thresholding method of [5]. The filtered 
gray image P gw is transformed into the neutrosophic domain, 
giving Pns as described in Section III-B. Then, the A-mean 
operation is employed to reduce the indetermination degree 
of the image Pns which is evaluated by the entropy of the 
indeterminate subset. Thus, the image becomes more uniform 
and homogenous and more suitable to be thresholded. We then 
use the adaptive thresholding method of [10] to obtain the 
binary image. 



C. Post-processing 

Adaptive thresholding produces a noisy binary image. Con- 
sequently, a post-processing stage is required to remove the 
noise. For this, a median filter is used with a window size of 
3 x 3” to enhance the binary image. 



IV. Proposed Approach 

Fig. 2 summarises the various steps of the proposed al- 
gorithm. Captured manuscript images RGB images Prgb 
(of different sizes and stored in simple bitmap format) are 



D. Summary 

Our proposed document analysis algorithm is summarized 
in Algorithm 1. 
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Algorithm 1 Proposed document analysis algorithm 

l: Read in RGB image Prgs(^j2/)- 

2: Convert to gray image P g (pc,y) using NTSC standard. 

3: Apply Wiener adaptive filter as pre-processing step to 
obtain P gw (x,y). 

4: Transform P gw (x,y ) into neutrosophic domain to obtain 
Pns (x,y) = {T(x,y),I(x,y),F(x,y)} according to the 
entropy of P gw (x,y ) and its mean. 

5: Measure the entropies of the three subsets T, /, and F. 

6: Apply a A-mean operation on PNs(x,y) to decrease its 
indetermination. 

7: Segment the true subset T using an adaptive thresholding 
technique. 

8: Apply a median filter to remove noise as post-processing 
step. 



V. Experimental Results 

In this section, the proposed method is evaluated and 
compared with other binarization methods from the literature. 

Our dataset contains samples collected from both the 
database of [2] and from the electronic Arabic manuscripts 
of [16]. In our evaluation, we focus on images that have 
several degradations such as multi-colored text lines, stains in 
the background, degraded characters and marks, and character 
diacritics. 

For evaluation of our results, ground truth images are 
generated using a similar method to the recent work of [17]. 

We compare the performance of our proposed algorithm 
with those obtained by other binarization methods, namely 
Otsu’s [8], Niblack’s [9], Sauvola’s [10], and Guo’s [5]. The 
latter uses a similar neutrosophic approach but the binary 
image is obtained using global thresholding. 

Fig. 3 demonstrates the results obtained for an example 
image of the dataset. 
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Fig. 3. Example results obtained by the different binarization algorithms. 



information. The other methods either fail to segment the 
foreground text, especially in the region of stains (Fig. 3(d), 
(e), and (f)), or segment foreground from background but 
add excessive noise (Fig. 3(f)). On the other hand, our final 
binary image suffers from some stroke-like pattern noise 
(SPN), which is due to the used Arabic manuscripts including 
diacritics. SPN [18], [19] is similar to diacritics, and hence its 
presence near textual components may change the meaning of 
a word. 

For the images of our dataset, the average processing time 
for our method was about 0.4 seconds for images of an average 
size of 500 x 500 (on an Intel Core 13-2310- CPU@2.10 GHZ 
with 3GB RAM 3.00, running Windows 7 and the algorithm 
implemented using Matlab R2009a). 

Several methods have been presented for the evaluation of 
document image binarization techniques, and can be classified 
into three main categories [1]. Evaluation can be performed 
by visual inspection by one or more human evaluators. Here, 
symbols that are broken or blurred, and loss of objects as 
well as noise in background and foreground are used as visual 
evaluation criteria. Clearly, this approach is time consuming 
and subjective. Evaluation based on optical character recog- 
nition (OCR) performance, applies OCR on the result image 
and uses the obtained character or word recognition accuracy. 
Applying OCR as an evaluation criterion is not possible for 
our experiments due to the lack of an efficient commer- 
cial software for recognizing handwritten Arabic manuscript 
writing [20]. Finally, direct evaluation of the binarization 
can be performed by taking into account the pixel-to-pixel 
correspondence between the ground truth and the binarized 
image. For this, several measures can be employed [21]-[23], 
of which we use the following in our tests: 

• F-measure [24], defined as 





p 2 x Precision x Recall 
Precision + Recall ’ 


(22) 


where 


TP 

Recall = — — — — , 

TP + FN 


(23) 


and 


. . TP 

Precision = — — — — — . 

TP + FP 


(24) 


Here, TP denotes true positives, TN true negatives 


,FP 



false positive, and FN false negatives, respectively. 

• Fj 3 -measure, defined as the weighted harmonic mean 
between precision and recall 

F (1 + /?) 2 x TP 

P (1 + /3) 2 x TP + /3 2 x FN + FP K ’ 

• PSNR, a similarity measure between two images, defined 
as [21], [25] 

PSNR=101„g 10 (-^g), (26) 



As can be seen, our algorithm outperforms the other 
approaches in terms of preservation of meaningful textual 



with 



MSE = 



E [h(m,n) - J 2 (m,n)] 2 
M xN 



(27) 
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where I\ and I 2 are the two images, and M and N the 
image dimensions. 

• Negative metric rate (NRM), which is based on pixel wise 
mismatches between the ground truth and the binarized 
image [26], and combines the false negative rate Nfn 
and the false positive rate Npp as 

Nfn I N F p 

NRM Nfn-\-N T p N F p-\-N T n (28) 

A better binarization quality is characterised by a lower 
NRM value. 

• Distance reciprocal distortion (DRD) metric, defined as 



DRD = 



£ DRD fc 



K= 1 



NUBN 



(29) 



where DRD^ is the distortion of the k-th flipped pixel 
and is calculated using a 5 x 5 normalized weight matrix 
W Njn [29] . DRDfc equals the weighted sum of pixels in 
the 5x5 block of the ground truth GT that differ from 
the centered k-th flipped pixel at (x, y) in the binarization 
result B 



2 2 

DRD fc = Y, T I GTk(i,j) ~ B k (i,j ) | x WN m (i,j). 

i =— 2 j =— 2 

(30) 

NUBN is the number of the non-uniform (not all black 
or white pixels) 8x8 blocks in the GT image [27]. 

Table I summarizes the results (averages over all dataset 
images) of the various binarization algorithms. As is clear from 
the obtained measures, our proposed approach provides the 
best results with respect to all performance metrics. 



TABLE 1 

Performance comparison of all binarization methods 



method 


F 


F/3 


PS NR 


NRM 


DRD 


Niblack 


84.47 


88.63 


13.51 


4.05 


0.072 


Otsu 


90.36 


90.72 


15.88 


2.65 


0.037 


Sauvola 


94.76 


95.34 


20.96 


1.20 


0.033 


Guo 


90.44 


90.89 


16.53 


2.60 


0.037 


Proposed 


95.59 


95.87 


23.57 


1.01 


0.028 



VI. Conclusions 

In this paper, we have proposed a hybrid thresholding tech- 
nique for degraded images of historical Arabic manuscripts. 
Our method combines a neutrosophic set approach with an 
adaptive thresholdinging method. Experiment results show that 
the proposed method provides good binarization performance 
for complex images that contain various challenges including 
stains, ink seeping, and characters written by different ink 
colors. Future work will focus on removal of stroke-like 
pattern noise to further improve the binarization results. 
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